Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-identification
نویسندگان
چکیده
RGB-infrared person re-identification is an emerging cross-modality task, which very challenging due to significant modality discrepancy between RGB and infrared images. In this work, we propose a novel modality-adaptive mixup invariant decomposition (MID) approach for towards learning modality-invariant discriminative representations. MID designs scheme generate suitable mixed images mitigating the inherent at pixel-level. It formulates procedure as Markov decision process, where actor-critic agent learns dynamical local linear interpolation policy different regions of under deep reinforcement framework. Such guarantees modality-invariance in more continuous latent space avoids manifold intrusion by corrupted samples. Moreover, further counter enforce visual semantics feature-level, employs convolution disassemble regular layer into modality-specific basis layers modality-shared coefficient layer. Extensive experimental results on two benchmarks demonstrate superior performance over state-of-the-art methods.
منابع مشابه
Supplementary Material for “RGB-Infrared Cross-Modality Person Re-Identification”
This supplementary material accompanies the paper “RGB-Infrared Cross-Modality Person Re-Identification”. It includes more details of Section 4, as well as extra evaluations of our proposed deep zero-padding method. 1. Details of Counting Domain-Specific Nodes In the third paragraph of Section 4.2 in the main manuscript, we quantify the number of domain-specific nodes in the trained network in ...
متن کاملQuery Based Adaptive Re-ranking for Person Re-identification
Existing algorithms for person re-identification hardly model query variations across non-overlapping cameras. In this paper, we propose a query based adaptive re-ranking method to address this important issue. In our work, negative image pairs can be easily generated for each query under non-overlapping cameras. To infer query variations across cameras, nearest neighbors of the query positive ...
متن کاملPose Invariant Embedding for Deep Person Re-identification
Pedestrian misalignment, which mainly arises from detector errors and pose variations, is a critical problem for a robust person re-identification (re-ID) system. With bad alignment, the background noise will significantly compromise the feature learning and and matching process. To address this problem, this paper introduces the pose invariant embedding (PIE) as a pedestrian descriptor. First,...
متن کاملHierarchical Invariant Feature Learning with Marginalization for Person Re-Identification
This paper addresses the problem of matching pedestrians across multiple camera views, known as person re-identification. Variations in lighting conditions, environment and pose changes across camera views make re-identification a challenging problem. Previous methods address these challenges by designing specific features or by learning a distance function. We propose a hierarchical feature le...
متن کاملView-Adaptive Metric Learning for Multi-view Person Re-identification
Person re-identification is a challenging problem due to drastic variations in viewpoint, illumination and pose. Most previous works on metric learning learn a global distance metric to handle those variations. Different from them, we propose a view-adaptive metric learning (VAML) method, which adopts different metrics adaptively for different image pairs under varying views. Specifically, give...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i1.19987